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We present the results of the combination of searches for the standard model Higgs boson 
produced in association with or Z boson and decaying into bb using the data sample collected 
with the DO detector in pp collisions at ^/s = 1.96 TeV at the Fermilab Tevatron Collider. We 
derive 95% C.L. upper limits on the Higgs boson cross section relative to the standard model 
prediction in the mass range 100 GeV < Mr < 150 GeV, and we exclude Higgs bosons with 
masses smaller than 102 GeV at the 95% C.L. In the mass range 120 GeV < Mh < 145 GeV, the 
data exhibit an excess above the background prediction with a global significance of 1.5 standard 
deviations, consistent with the expectation in the presence of a standard model Higgs boson. 
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PACS numbers: 14.80.Bn, 13.85. Ni, 13.85. Qk, 13.85. Rm 



Despite its success as a predictive tool, the standard 
model (SM) of particle physics [ij remains incomplete 
without a means to explain electroweak symmetry break- 
ing. The simplest proposed mechanism 2] involves the 
introduction of a complex doublet of scalar fields that 
generates the masses of elementary particles via their 
mutual interactions. After accounting for longitudinal 
polarizations for the electroweak bosons, this mechanism 
also gives rise to a single scalar boson, the SM Higgs bo- 
son, with an unpredicted mass (Mh)- Direct searches for 
e+e" Z* ^ ZH at the CERN e+e~ Colhder (LEP) 



*with visitors from "Augustana College, Sioux Falls, SD, USA, 
''The University of Liverpool, Liverpool, UK, '^UPIITA-IPN, Mex- 
ico City, Mexico, ''DESY, Hamburg, Germany, '^SLAC, Menlo 
Park, CA, USA, J" University College London, London, UK, ^Centro 
de Investigacion en Computacion - IPN, Mexico City, Mexico, 
''ECFM, Universidad Autonoma de Sinaloa, Culiacan, Mexico and 
' Universidade Estadual Paulista, Sao Paulo, Brazil. 



yielded a lower mass limit of Mh > 114.4 GeV fs*] at 
95% confidence level (C.L.). Precision electroweak mea- 
surements M, including the latest W boson mass mea- 
surements [3, 0] at the Fermilab Tevatron Collider, re- 
sult in an upper 95% C.L. limit of Mh < 152 GeV. 
Direct searches at LEP ^3'|, the Tevatron [7|], and the 
CERN Large Hadron Collider (LHC) i, i] exclude at 
the 95% C.L. most of the allowed mass range, except for 
116.6 GeV < Mh < 119.4 GeV and 122.1 GeV < Mh < 
127.0 GeV. In addition the ATLAS and CMS CoUabo- 
rations have published [sl, fp] excesses above background 
expectations at a mass of w 125 GeV and have recently 
published results [13] confirming these excesses at the 
level of 5 standard deviations (s.d.), driven by searches 
for 77 ^ 77 and ZZ^*'> ~> £+£-£'+£'-, where £ and 

£' denote an electron or muon. These searches primar- 
ily exploit the gluon-gluon fusion production mechanism 
for the Higgs boson, gg — > H, mediated by a top-quark 
loop, while — > 77 searches are also sensitive to vector 



4 



{V = W, Z) boson fusion, qq' — >• Hqq'. In the allowed 
mass range, the Tevatron experiments are particularly 
sensitive to the SM Higgs boson produced in association 
with a vector boson, VH, and the Higgs boson decay- 
ing into bb, the primary decay mode for a Higgs boson 
with Mh < 135 GeV. Searches at both hadron colliders 
have a high degree of complementarity, with the main 
search channels at the LHC being particularly sensitive 
to the Higgs boson mass and couplings to vector bosons, 
while searches at the Tevatron provide information on 
the Higgs boson coupling to b quarks. 

This Letter describes the combination of searches for 
VH, H ^ bb production at the DO experiment using 
the sample of pp collision data at i/s = 1.96 TeV col- 
lected during Run II of the Fermilab Tevatron Collider. 
These searches are focused on leptonic W and Z boson 
decays that allow us to efficiently suppress the large mul- 
tijet background present at a hadron collider and are re- 
stricted to the mass range 100 GeV < Mh < 150 GeV. 
Therefore, the signal processes being targeted are WH 
eubb ZH ^ vvbb [12], and ZH -> l+£-bb [H. A 



similar combination of searches in the H ^ bb decay 
mode has recently been reported by the CDF Collabora- 
tion [Hi and previously by the ATLAS [ll], CMS 0, 
and LEP [i| Cohaborations. 

The DO detector is described elsewhere 
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on the reconstruction and identification criteria for the 
physics objects used in these searches [electrons, muons, 
jets, and missing transverse energy [Ifirp )] can be found 
elsewhere [lll - lisl . Jets are identified as consistent 

with the fragmentation of a 6 quark (6-tagged) by a mul- 
tivariate algorithm [loj combining information from the 
impact parameter of displaced tracks and the topological 
properties of secondary vertices reconstructed in the jet. 

The main backgrounds affecting these searches orig- 
inate from VF/Z-|-heavy-flavor jets (jets initiated by b 
and c quarks) and from top-quark pair (tf) production. 
Smaller contributions arise from M^/Z-l-light-flavor jets, 
single top-quark, diboson {WW,WZ, ZZ), and multijet 
production. Multijet events contribute to the selected 
samples via the misidentification of a jet or a photon as 
an electron, the presence of a non-prompt lepton from 
a semileptonic b- or c-hadron decay {WH — >■ ii/bb and 
ZH — > i^i'bb analyses), or jet energy mismeasurements 
resulting in apparent large ]pj,[ZH — vvbb analysis). 
In all instances, the normalization and kinematic distri- 
butions of multijet events are estimated via data-driven 
methods. The remaining backgrounds, as well as the sig- 
nal, are estimated with Monte Carlo simulation. Samples 
of ly/Z-f-jets and tt events are generated by using the 
ALPGEN [20| tree-level matrix element generator, while 
samples of single top-quark and diboson events are gen- 
erated by using the SingleTop jllj and pythia [2^ 
leading-order (LO) generators, respectively. These sam- 
ples are normalized to next-to-next-to-LO (NNLO) [1^, 
approximate NNLO [H, [iil, and next-to-LO [2^ theo- 



retical cross sections. Samples of WH and ZH signal 
events are generated by using the pythia generator for a 
range of masses, 100 GeV < Mh < 150 GeV, in steps of 
5 GeV and are normalized to the most recent theoretical 
predictions 27-2§|. All Monte Carlo samples are gener- 
ated by using the CTEQ6L1 PDF set [13] and processed 
through PYTHIA to model parton showering and fragmen- 
tation. Signal and backgrounds samples are processed 
by a GEANTS-based [sH simulation of the DO detector 
and reconstructed by using the same algorithms applied 
to the collider data. Simulated events are corrected so 
that the object identification efficiencies, energy scales, 
and energy resolutions match those determined in data 
control samples. More details on the simulation and nor- 
malization of th e signal and background samples can be 



found elsewhere [lll-ll3[ 



In the case of the ZH vvbb analysis, the data were 
collected by using triggers requiring jets plus and cor- 
respond to an integrated luminosity of 9.5 fb~^ [s^]. The 
ZH £~^£~bb and WH — >■ £vbb analyses use a logical OR 
of triggers dominated by single lepton, dilepton, lepton- 
plus-jets, and jet-plus- -^^^ triggers, resulting in an inte- 
grated luminosity of 9.7 fb~^. The analyses select non- 
overlapping subsets of data via different requirements on 
lepton multiplicity: (i) exactly two opposite-charge lep- 
tons {ZH — £+^^66), (ii) exactly one charged lepton and 
large ]f!j,{WH — >■ ivbb), and (iii) exactly zero charged 
leptons and large l^j, {ZH — >■ vvbb). A significant frac- 
tion of signal events selected by the ZH — > vvbb analysis 
originate from WH production, where the charged lep- 
ton is not reconstructed. In addition, events are required 
to have two or three reconstructed jets, with the excep- 
tion of the ZH vvbb analysis, which is restricted to 
events with exactly two jets. The signal-to-background 
ratio is significantly enhanced by requiring one or two 
6-tagged jets in an event. The sensitivity of the searches 
is maximized by categorizing events into different anal- 
ysis subchannels depending on the flavor and quality of 
the charged leptons, jet multiplicity, 6-tagged jet multi- 
plicity, and 5-tagged jet quality. The primary discrim- 
inating variable between the VH signal and the back- 
grounds is the dijet invariant mass, for which the signal 
shows a distinct resonant structure; however, by com- 
bining this variable with several other kinematic vari- 
ables via a multivariate approach, the sensitivity of the 
searches is improved by approximately 25%. Therefore, 
the final observable for each of the subchannels in the 
different searches is a one-dimensional multivariate dis- 
criminant optimized for each hypothesized Mh value. 

We in terp ret the result of the searches via the CLg 
method [Ij, HI], which employs a log-likelihood ratio 
LLR = — 2 ln(Ls+b/ib) as a test statistic, where Ls+b 
(Lb) is a Poisson hkelihood to observe the data under 
the signal-plus-background (background-only) hypothe- 
sis. Separate channels are combined by summing LLR 
values over all bins, thus maintaining the individual chan- 
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nel sensitivities. The per-bin signal and background pre- 
dictions are parameterized in terms of nuisance parame- 
ters that describe the effect of systematic uncertainties. 
The impact of systematic uncertainties on the search sen- 
sitivity is reduced by maximizing both hkehhood func- 
tions ig+b and ibj with respect to these nuisance pa- 
rameters, subject to Gaussian constraints of their prior 
values. CLg is defined as the ratio of the confidence levels 
for the signal-plus-background (CLg+b) and background- 
only (CLb) hypotheses, which are each evaluated by in- 
tegrating the corresponding LLR distributions populated 
by simulating outcomes via Poisson statistics. System- 
atic uncertainties are incorporated via Gaussian fluctua- 
tions on the expected number of signal and background 
events per bin, taking into account correlations across 
processes and channels |35i]. Signal cross sections result- 
ing in CLg < 0.05 are excluded at the 95% C.L. 

The systematic uncertainties differ between analyses, 
but we summarize here the largest contributions. We 
account for the impact of these uncertainties both on 
the integrated signal and background yields and on the 
shapes of the flnal discriminants where relevant. The 
ZH — >■ vvhh and WH — )■ Ivhh analyses carry a correlated 
uncertainty on the integrated luminosity of 6.1% [s^ . 
The ZH — )■ £~^i~bb analysis normalizes the predictions 
using the peak from Z — )■ £~^£~ decays from data and the 
corresponding NNLO cross section [i^l- The 6-tagging 
efficiency has an uncertainty of «1%-15%, depending on 
the sample and 6-tagging criteria. The uncertainty due 
to acceptance and energy measurement of jets is typically 
around 7%. Uncertainties due to acceptance and energy 
measurement of leptons range from 1% to 9%, depend- 
ing on the final state. A significant source of uncertainty 
comes from the T^-|-jets background cross sections, which 
have uncertainties of 4%-10% for light flavor jets and 
w 22% for heavy flavor jets. These account for both 
the uncertainty on the theoretical cross section calcula- 
tions and the uncertainties on the higher-order correction 
factors. The uncertainty on the expected multijet back- 
ground is dominated by the statistics of the data sample 
from which it is estimated, and is considered separately 
from the other cross section uncertainties. All analy- 
ses take into account the uncertainties on the theoreti- 
cal production cross sections for the different signal pro- 
cesses due to PDF and scale choice. In addition, analy- 
ses incorporate differential uncertainties on the dominant 
backgrounds to allow for potential variations of the final 
discriminants due to generator and background modeling 
uncertainties. The total impact of systematic uncertain- 
ties on the combined sensitivity is « 20%. 

To confirm the ability of these analyses to measure a 
signal and to validate the background modeling, we per- 
form a measurement of the VZ production cross section 
in the same final states. The only difference from the 
Higgs boson search is to use SM WZ and ZZ production 
as the signal instead of WH and ZH, while the rest of the 
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FIG. 1: (color online). Background-subtracted data distribu- 
tions of logj^o(s/6) in (a) the VZ analysis after a fit of the VZ 
and background contributions to the data and (b) the VH, 
H bb search for Mr = 125 GeV after a fit of the back- 
grounds to the data. The background-subtracted data are 
shown as points and the signal is shown as the red histogram 
in each plot. The blue lines indicate the posterior uncertainty 
on the background prediction. 



SM processes, including WW production, are treated as 
backgrounds. Multivariate discriminants using the same 
input variables as in the Higgs boson searches are trained 
to separate the VZ signal from the backgrounds and the 
resulting distributions are fit to determine the VZ cross 
section. The combination of all three analyses yields 
a{VZ) = 3.3 ± 1.4 pb, consistent with the SM prediction 
of 4.4 ±0.3 pb [2^. The observed (expected) significance 
of the measured excess is 2.5 (3.4) s.d. 

The statistical analysis makes use of simultaneous fits 
to the individual final discriminants, but it is useful for 
presentation purposes to collect all of the inputs into a 
single distribution. This is done by reordering the bins 
from the input distributions according to their signal- 
to-background ratios (s/b), so that bins with similar 
logiQ{s/b) are combined. Figure [T] shows this distribu- 
tion for the VZ cross section measurement and for the 
Higgs boson search with Mh = 125 GeV after subtracting 
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FIG. 2: (color online), (a) The 95% C.L. cross section upper limit ratios versus Mh, and (b) LLR distribution versus Mh, for 
the combined VH, H ^ bb analyses. The solid lines represent the observed values in the data. The short-dashed black (red) 
lines represent the median expected values under the background-only (signal-plus-background) hypothesis at each mass. The 
long-dashed blue lines show the expected outcome from injecting a SM Higgs boson signal with Mh ~ 125 GeV. The green and 
yellow shaded bands correspond to the regions enclosing 1 and 2 s.d. variations about the median expected values under the 
background-only hypothesis, respectively. 



TABLE 1: Expected (median) and observed 95% C.L. cross section upper limit ratios for the combined VH, H bb analyses 
over the 100 GeV < Mh < 150 GeV mass range. 



Mh (GeV) 


100 


105 


110 


115 


120 


125 


130 


135 


140 


145 


150 


Expected: 


1.2 


1.3 


1.4 


1.6 


1.9 


2.3 


2.9 


3.8 


5.3 


7.8 


12 


Observed: 


0.94 


1.1 


1.2 


1.9 


2.6 


3.2 


4.3 


6.5 


8.0 


12 


14 



the expected background from the data. The subtracted 
background corresponds to the maximum-likelihood fit of 
the nuisance parameters to the data, and the posterior 
uncertainty from that fit is also shown in the plot. 

We derive limits on SM Higgs boson production 
a{VH) X BR(iJ — >66) for Higgs boson masses in the range 
100 GeV < Mh < 150 GeV in steps of 5 GeV. We assume 
the relative contributions of the different production and 
decay modes as given by the SM prediction. We present 
our results in terms of the ratio of 95% C.L. upper cross 
section limits to the SM predicted cross section. The 
SM prediction for Higgs boson production would there- 
fore be considered excluded at 95% C.L. when this limit 
ratio falls below unity. Figure [H^a) shows the combined 
expected and observed 95% C.L. cross section limits as 
a ratio to the SM cross section as a function of Mh- 
These results are also summarized in Table ID The LLR 
distributions for the combination are shown in Fig.[2][b). 
Although consistent with the background-only hypoth- 
esis for Mh < 115 GeV, the observed LLR exhibits a 
signal-like excess at the level of 1-1.7 s.d. for the mass 
range 120 GeV < Mh < 145 GeV 

To understand the compatibility of this excess with the 
hypothesis of a SM Higgs boson, we obtain the best-fit 
cross section for the Higgs boson signal relative to the 
SM prediction {R^*") as a function of Mh- This value is 



obtained by performing a maximum-likelihood fit over all 
search channels simultaneously, allowing the fit to vary 
all nuisance parameters within their priors and with the 
Higgs boson cross section as a free parameter. Figure [3] 
shows the measured a{VH) x BR(iJ — > bb) as a function 
of Mh, including its ±1 s.d. uncertainty band, and com- 
pared with the SM prediction. At a mass of 125 GeV, 
the best-fit cross section is a{VH) x BR{H — > bb) — 
140li3o which is 1-2^1'^^ times the SM prediction. 

The significance of the data excess above the back- 
ground prediction is estimated by computing the p value 
under the background-only hypothesis using R^^ as the 
test statistic for each value of Mh- This p value rep- 
resents the probability to have a value of R^^ as large 
or larger than that observed in the data due to a back- 
ground fiuctuation. The smallest p value is obtained at 
a mass of 135 GeV and corresponds to a significance of 
1.7 s.d. above the background-only prediction. This sig- 
nificance does not take into account the look-elsewhere- 
effect [3^, which accounts for the possibility of a back- 
ground fluctuation in the local p value anywhere in the 
tested mass range. By taking into account existing limits 
on Mh in the bb decay mode 0, the search region be- 
comes 115 GeV < Mh < 150 GeV. Given the expected 
mass resolution of these searches of « 16%, this trans- 
lates into a look-elsewhere-effect factor of « 1.6 for a 
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FIG. 3: (color online). The best-fit value for a{VH) x 
BR(// -> bb) as a function of Mh- The green shaded band 
corresponds to the 1 s.d. uncertainty around the best-fit cross 
section. Also shown is the SM prediction including the theo- 
retical uncertainties. 



global significance of 1.5 s.d. Also taking into account the 
existing SM Higgs boson exclusions from the LHC [1, 0] 
experiments, there is no look-elsewhere-effect and we find 
an excess at Mh — 125 GeV with a significance of 1.1 s.d. 

In summary, we have presented a combination of 
searches for the SM Higgs boson produced in association 
with a vector boson and decaying into bb, using the data 
sample collected with the DO detector in Run II of the 
Fermilab Tevatron Collider. We achieve a sensitivity that 
is competitive with other searches in this final state 
16|, deriving 95% C.L. upper limits on the Higgs boson 
cross section relative to the SM prediction in the mass 
range 100 GeV < Mh < 150 GeV and excluding Higgs 
bosons with masses smaller than 102 GeV at the 95% 
C.L. In the mass range 120 GeV < Mh < 145 GeV, the 
data exhibit an excess above the background prediction 
with a global significance of 1.5 s.d. and a magnitude 
consistent with that expected for the SM Higgs boson. 
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